Template-based spectral estimation using microphone array for speech recognition
نویسندگان
چکیده
This paper proposes a Template-based Spectral Estimation (TSE) method for noise reduction of microphone array processing aiming at speech recognition enhancement. In the proposed method, a noise template in a complex plane is calculated for each frequency bin using non-speech audio signals observed at microphones. Then for every noise-overlapped speech signals, a speech signal can be reformed by applying the template and the gradient descent method. Experiments were conducted to evaluate not only performance of noise reduction but also improvement of speech recognition. Then NRR 16.7dB improvement was achieved by combining TSE and Spectral Subtraction (SS) methods. For speech recognition, 44% relative recognition error reduction was obtained comparing with the conventional SS method.
منابع مشابه
Generalized multi-microphone spectral amplitude estimation based on correlated noise model
Enhancing speech contaminated by uncorrelated additive noise, when the degraded speech alone is available, has received much attention. In recent years many systems have used multi-microphone arrays for the task of speech enhancement and robust speech recognition. In this paper we introduce a generalized multi-microphone spectral amplitude estimation approach based on a model with non-negligibl...
متن کاملTwo-Microphone Noise Reduction Using Spatial Information-Based Spectral Amplitude Estimation
Traditional two-microphone noise reduction algorithms to deal with highly nonstationary directional noises generally use the direction of arrival or phase difference information. The performance of these algorithms deteriorate when diffuse noises coexist with nonstationary directional noises in realistic adverse environments. In this paper, we present a two-channel noise reduction algorithm usi...
متن کاملAutomatic Speech Recognition of Human-Symbiotic Robot EMIEW
Automatic Speech Recognition (ASR) is an essential function of robots which live in the human world. Many works for ASR have been done for a long time. As a result, computers can recognize human speech well under silent environments. However, accuracy of ASR is greatly degraded under noisy environments. Therefore, noise reduction techniques for ASR are strongly desired. Many approaches based on...
متن کاملCMSC 660 Project Solutions Optimization methods for Sound Source Localization using Microphone arrays
Microphone arrays are widely employed for applications like teleconferencing, high quality sound capture, speaker recognition/identification, acoustic surveillance, head aid devices, speech acquisition in automobile environments etc. For all these applications the benefits that a microphone array provides over a single microphone are two fold. First using a microphone array we can localize a so...
متن کاملMicrophone array post-filter based on noise field coherence
This paper introduces a novel technique for estimating the signal power spectral density to be used in the transfer function of a microphone array post-filter. The technique is a generalization of the existing Zelinski post-filter, which uses the autoand cross-spectral densities of the array inputs to estimate the signal and noise spectral densities. The Zelinski technique, however, assumes zer...
متن کامل